Conjugates, Correlation and Quantum Mechanics 
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The Jordan structure of finite-dimensional quantum theory is derived, in a conspicuously easy 
way, from a few very simple postulates concerning abstract probabilistic models (defined by a set 
of basic measurements and a convex set of states) . The key assumption is that each system A can 
be paired with an isomorphic conjugate system, A, by means of a non-signaling bipartite state r\A 
perfectly and uniformly correlating each basic measurement on A with its counterpart on A. In the 
case of a quantum-mechanical system associated with a complex Hilbert space ~H, the conjugate 
system is that associated with the conjugate Hilbert space ft, and r\A corresponds to the maximally 
entangled state on "H. ® H- 



I. INTRODUCTION AND BACKGROUND 

This paper derives the Jordan structure of finite- 
dimensional quantum theory from a very lean set of pos- 
tulates, and in a conspicuously easy way — easy, at any 
rate, if one knows the Koecher-Vinberg Theorem, which 
relates cuclidcan Jordan algebras to homogeneous, self- 
dual cones . This brings us within hailing distance of 
orthodox finite-dimensional quantum theory: every eu- 
clidean Jordan algebra is a direct sum of self-adjoint parts 
of real, complex or quaternionic matrix algebras, spin fac- 
tors, and the exceptional jordan algebra of self-adjoint 
3x3 matrices over the octonions flOj ] . 

In fact, I present two slightly different results, both 
resting on the idea of a conjugate system. Here is a 
sketch. As is well known, any mixed state on a quantum- 
mechanical system is the marginal of a pure bipartite 
state on a composite of two copies of that system. This 
latter state perfectly correlates some pair of basic observ- 
ables on these two copies. Using not two copies of the 
same system but rather, a system and its conjugate sys- 
tem (associated with the conjugate of the first system's 
Hilbert space), the maximally mixed state arises as the 
marginal of a bipartite state — essentially, the maximally 
entangled state — perfectly correlating every observable 
with its conjugate. 

These correlational features can be abstracted. A 
finite-dimensional probabilistic model A is characterized 
by a set of basic measurements and a finite-dimensional 
convex set of states. From this data, one can construct, 
in a canonical way, a pair of ordered real vector space 
V(A) and E(A) < V(A)*, the former generated by A's 
states, and the latter by its basic measurement out- 
comes. Suppose that all basic measurements of A have 
a common number n of outcomes. Define a conjugate for 
A to be a model A, plus a fixed isomorphism ja '■ A ~ A 
and a fixed bipartite, non-signaling state t\a between A 



1 See section 2 for a brief explanation of these terms, and [lOj 
for an accessible proof of the Koecher-Vinberg Theorem. Given 
the use of such a powerful off-the-shelf mathematical tool, I am 
perhaps "using a canon to shoot a canary" . But this is the sport 
of kings; and anyway, it's a big canary. 



and A, such that, for every basic measurement outcome 
x of A, 7j.a(x, 7a(x)) = 1/n. Assume now that (i) A 
has a conjugate system, and, further, that (ii) every 
state of A dilates to a non-signaling bipartite state 
between A and A that perfectly correlates some pair of 
basic measurements. Further, suppose (iii) every basic 
measurement outcome has probability one in a unique 
state, and (iv) every non-singular state (every state 
strictly positive on basic measurement outcomes) can 
be prepared, up to normalization, from the maximally 
mixed state by means of a reversible process. 

Theorem 1 Subject to conditions (i)-(iii), E(vl) is 
self-dual, and isomorphic to V(A). Subject to condition 
(iv), V(A) is homogeneous. Thus, subject to conditions 
(i)-(iv), E(A) is homogeneous and self-dual, and hence, 
by the Koecher- Vinberg Theorem, has the structure of a 
euclidean Jordan algebra. 

In the presence of a conjugate, a slightly stronger 
preparability hypothesis than (iv) yields both the 
homogeneity and the self-duality of E(A)+, without the 
need for assumptions (ii) and (iii) above. A filter is a 
process <& (that is, a positive mapping V(A) — > V(A)) 
that independently attenuates the reliability of each 
outcome of some basic measurement, in the sense that 
for each outcome x, there is a constant t x £ (0, 1] with 
$>(a)(x) — t x a(x) for all states a. Suppose A has a 
conjugate, A: starting in the canonical state t]a, we can 
apply $ to the first component of the composite system 
AA to obtain a new bipartite state ($ ® 1a)(va) (where 
1-t is the identity transformation on A's state space). 
We can also apply the counterpart of $ to A, obtaining 
(1 <& §>)(t]a)- Call $ symmetric iff these states are the 



Theorem 2: Let A have a conjugate system A. If every 
interior state of A arises from the maximally mixed state 
by a symmetric reversible filter, then A satisfies (ii) and 
(iii); hence, E(A) + is homogeneous and self-dual. 

The proofs of both of these results are quite short and 
straightforward. Several of the ideas in this paper were 
earlier explored, and somewhat similar results derived, in 
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17 and [iq [. However, the approach taken here is much 
simpler and more direct. 



A. General Probabilistic Theories 

The general framework for this paper is that of [f| [I?} , 
which I'll now quickly review. In a few places (set 
off in numbered definitions), my usage differs slightly 
from that of the cited works. See @, [HI for further 
information on ordered vector spaces and on Jordan 
algebras. 

Probabilistic Models A probabilistic model is charac- 
terized by a set Ai{A) of basic measurements or tests, 
and a set tt(A) of states. Identifying each test with its 
outcome-sest, M(A) is simply a collection of non-empty 
sets (a test space, in the language of I'll write X(A) 
for the union of this collection, i.e., the space of all out- 
comes of all basic measurements. I understand a state 
to be an assignment of probabilities to measurement- 
outcomes, that is, a function a : X(A) — > [0, 1] such that 

Yt,x<zE a ( x ) = 1 for a11 tests E e M{A). To reflect the 
possibility of forming statistical mixtures, I also assume 
that il(A) is convex. 2 

By the dimension of a model A, I mean the affinc 
dimension of tt(A). In the simplest finite-dimensional 
classical model, M(A) consists of a single, finite test, 
and fi(A) is the simplex of all probability weights on 
that test. Of more immediate interest to us is the 
quantum model A(H) = (A4('H),£l('H)) associated with 
a complex Hilbert space 'H. The test space M.(JV) is the 
set of orthonormal bases of H; thus, the outcome-space 
X(H) is the set of unit vectors of H. The state space 
CliTi) consists of the quadratic forms associated with 
density operators on H, so that a state a E il('H) has 
the form a{x) = (W a x, x) for some density operator W a , 
and all unit vectors x 6 X(H). Real and quaternionic 
quantum models, corresponding to real or quaternionic 
Hilbert spaces, are defined in the same way. 

Remark: Not every physically accessible observable on 
a finite-dimensional quantum system is represented by 
an orthonormal basis. Rather, the general observable 
corresponds to a positive-operator-valued measure- 
ment. Similarly, for an arbitrary probabilistic model 
A, the test space M.(A) may, but need not, represent 
a complete catalogue of all possible measurements 
one might make on the system represented by A: 
rather, it is some convenient catalogue of such measure- 
ments, sufficiently large to determine the system's states. 



The spaces E(A) and V(A) An ordered vector space is 
a real vector space E equipped with a convex cone E + 
with E + n -E+ = {0} and E = E + - E + — that is, 
E is spanned by E+. The cone induces a partial order, 
invariant under translation and multiplication by non- 
negative scalars, given by a < 6 iff & — a E E+. An 
example is the space hhifi) of hermitian operators on 
a real, complex or quaternionic Hilbert space, ordered 
by the cone of positive semi-definite operators. A linear 
mapping T : E — > F between ordered vector spaces E and 
F is positive iff T(E+) C F + . The dual cone, W + C E*, 
consists of positive linear functionals /EE*. 

Any probabilistic model A gives rise in a canon- 
ical way to a pair ordered vector spaces E(A) and 
V(A). The latter is simply the span of the state space 
fl(A) in the space K^" 4 ', ordered by the cone V(A)+ 
of non-negative multiples of states. Every outcome 
x E X(A) corresponds to a linear evaluation functional 
x : V(A) -> R, given by x(a) = a(x) for all a E V(A). 
The space E(A) is the span of these functionals in 
V(A)*, ordered by the cone E(A)+ consisting of all 
finite linear combinations J^. tixi having non-negative 
coefficients U. Note that a E E(A)+ implies a(a) > for 
all a E V(A)+. The converse is generally false. Note, 
too, that there is a distinguished vector ua E E(A) + 
given by ua = J2 X £E x 'i this is independent of the choice 
of E e M{A). Any state a £ fl(A) satisfies ua(ol) = 1; 
again, the converse is generally false. 3 

Processes A physical process on a system A is natu- 
rally represented by an affine (convex-linear) mapping 
T : tt(A) -> V(A) such that, for every a E tt(A), 
T(a) = p/3 for some /3 E fl(A) and some constant 
< p < 1 (depending on a), which we can regard as the 
probability that the process occurs, given that the initial 
state is a. Such a mapping extends uniquely to a positive 
linear mapping T : V(A) -4 V(A) with T(a)(u A ) < 1 for 
all a E n{A). I will say that a process T : V(A) ->■ V(B) 
prepares a state a of A from another state, /?, if a is a 
multiple of T(/3), i.e., T(/3) coincides with a up to nor- 
malization. 

A process T : V(A) — > V(B) has a dual action on 
V(A)*, given by T*(/) = foT for all / E V(A)*, with 
T*(u) < u. In our finite-dimensional setting, we can 
identify ~V(A)* with E(A) as vector spaces, but not, gen- 
erally, as ordered vector spaces. While $* will preserve 
the dual cone V(v4)*l, it is not required that T* pre- 
serve the cone E(v4) < V(A)*. This reflects the idea 
that not every physically accessible measurement need 
appear among the tests in A4(A), as discussed above. 

A process T : ~V(A) —> ~V(A) reversible iff there is 



2 It is usually also assumed that Q(A) is closed (hence, compact) 
in the product topology on M. x . This assumption isn't needed 
here, however. 



3 A model is said to be state- complete iff every a 6 V(A)+ with 
ua(c<) = 1 belongs to Q(A). State-completeness is frequently, 
though often tacitly, assumed in much of the recent literature on 
generalized probabilistic theories, including many of my earlier 
papers, but is not assumed here. 
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another process, S, such that, for every state a, there 
exists a constant p G (0,1] with S(T(a)) = pa. In 
other words, S allows us to recover a from T(a), up 
to normalization. It is not hard to see that p must be 
independent of a, so that 5" = pT^ 1 . In particular, T is 
an order-isomorphism of V(A). 

Self-Duality and Jordan Algebras For both classi- 
cal and quantum models, the ordered spaces E(A) and 
V(A) are isomorphic. In the former case, where A4(A) 
consists of a single test E and fi(A) is the simplex of 
all probability weights on E, we have V(A) ~ R E and 
E(A) ~ (R E )*. The standard inner product on R E pro- 
vides an order-isomorphism between these spaces, that 
is, a linear bijection taking the positive cone of one space 
onto that of the other. If H is a finite-dimensional real 
or complex Hilbert space, we have an affine isomorphism 
between the state space of fl{H) and the set of density 
operators on W, allowing us to identify ~V(A(H)) with 
Ch{H). For any x G X(H), the evaluation functional 
x G V(A) is then given by W M- {Wx, x) = Tr(WP x ). It 
follows that E(A(7*)) ~ C h {H)* ~ C h (H), with the lat- 
ter isomorphism implemented by the tracial inner prod- 
uct (a, b) = Tt(ab). 

More generally, any inner product (, ) on an ordered 
vector space E defines a positive linear mapping E — > E* . 
If this is an order-isomorphism, one says that E is self- 
dual with respect to this inner product . This is equivalent 
to the condition a 6 E + iff (a, b) > for all b G E + . In 
this language, the standard inner product on M. E and the 
tracial inner product on L/ l ('H) are self-dualizing, where 
E is a finite set and f£, a finite-dimensional Hilbert space. 

In fact, any euclidean Jordan algebra, ordered by its 
cone of squares, is self-dual with respect to its canonical 
tracial inner product. Another property shared by all 
euclidean Jordan algebras is homogeneity: the group 
of order-automorphisms of E acts transitively on the 
interior of the positive cone E+. The Koecher-Vinberg 
Theorem [lj| states that, conversely, any self-dual, ho- 
mogenous order-unit space E can be equipped with the 
structure of a euclidean Jordan algebra. This structure 
is unique, up to the choice of an element of the interior 
of the cone E + to serve as a unit for the Jordan algebra. 

Definition 1: Let us say that a model A is self-dual 
iff E(A) is self-dual and E(A)+ ~ V(A)+. Call A 
homogeneous iff V(A) is homogeneous. 

If the model A is both homogeneous and self-dual, 
then E(A) is homogeneous and self-dual, and hence, 
E(A) + is the cone of squares of a euclidean Jordan 
algebra. Of these two properties, homogeneity is the 
easier to interpret in physical terms: it amounts to 
assumption (iv) in the introduction, namely, that every 
state in the interior of V(A)+ can be obtained, up 
to normalization, from some particular such state by 
means of a reversible process. Self-duality is less easily 
motivated, but will emerge from the other assumptions 



sketched in the introduction. 

Composite Systems A composite of two probabilistic 
models A and B is a model AB, together with a mapping 
X{A) x X{B) — > X{AB) allowing us to interpret a pair 
(x, y) of outcomes belonging to the two systems sepa- 
rately as a single product outcome xy G X(AB), in such 
a way that, for any tests E G M(A) and F G M(B), the 
set EF = {xy\x G E, y G F} is a test in AB. It follows 
that any state uj G ft(AB) pulls back to a function 
which I'll also denote by u — on X(A) x X(B), given by 
uj(x,y) = u(xy), satisfying ^xsb.j/GF u (xy) = 1 for all 
tests E G M(A), F G M(B). One understands w(x,y) 
as the joint probability of the outcomes x and y in the 
bipartite state uj. 

Remark: This definition is weaker than that used in, 
e.g., [||, where it is also assumed that, for any pair of 
states a G ft (A) and j3 G ft(B), there exists a unique 
state a®/? on AB such that (a 55 (3)(x,y) — a(x)/3(y) 
for all outcomes x G X(A), y G X(B). This assumption 
is not needed in what follows. 

Non-Signaling Composites A state w on a composite 
AB is non-signaling iff the marginal states u>x(x) = ~^2 yeF 
and u>2{y) = X^e-E^ 2 "^) are wc U-defincd, i.e., indepen- 
dent of the choice of tests E and F. One can then also 
define conditional states u)2\x(y) '■= w(x,y)/u>i(x) (with, 
say, UJ2\ X identically zero if u>i(x) — 0), and similarly for 
^i\y This gives us the following bipartite version of the 
law of total probability [ll| : 

cj 2 = uji(x)uj 2 \x and = ^^2(2/)^% (1) 

xGE yEF 

for any choice of E G M(A) or F G M{B). 

Classical and quantum bipartite states are clearly 
non-signaling. 

Definition 2: A non- signaling composite of A and B, 
I mean a composite AB such that every state ui of AB 
is non-signaling, and the conditional states Wju and 
ll>2\x ar c valid states of A and B, respectively, for all 
outcomes y G X(A) and x G X(A). 

As an example, if H and K, arc real or complex Hilbert 
spaces, there is a natural mapping X(H) x X(K) — > 
X(7i ® 7C), namely (x, y) ^ x ® y. It is straightfor- 
ward to check that this makes A{Ji ® K) a non-signaling 
composite of A(H) and A(7C), in the above sense. 

It follows from (1) that if oj G S!(A_B) is non-signaling, 
the marginal states u\ and W2 also belong to f2(A) and 
f2(5), respectively. It is not hard to show that a state ui 
on a non-signaling composite AB gives rise to a bilinear 
form w : E(A) xE(B) — > K, uniquely defined by y) = 
u>(xy) for all outcomes a; G X(A),y G -Y(i3), and hence 
also to a positive linear conditioning map 

Q : E(A) V(B) 
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given by u(a)(b) — u)(a, b). Note that for any x G X(A), 
lo{x) = loi(x)lo2\ x , whence the terminology. If uj is an 
order-isomorphism, then uj is called an isomorphism 
state. 

Probabilistic Theories As a rule, one wants to think 
of physical theories, not as loosely structured classes, but 
rather, as categories of systems [H, H[. In what follows, 
a probabistic theory is understood to be a category, of 
probabilistic models, with morphisms corresponding to 
processes — that is, if A and B are models belonging 
to the theory, then morphism A — >• B are positive linear 
mappings T : V(A) — > V(B), subject to the condition 
that T{o)(ub) < 1 for all a G Q(A), as discussed above. 
A monoidal probabilistic theory is one closed under some 
definite operation A, B t— > AB of forming non-signaling 
composites, and subject also to the further condition 
that, for all models A\,A2,B\ and B2 G C and all pro- 
cesses T x : V(Ai) -> V(B X ) and T 2 : V(A 2 ) -> V(5 2 ), 
there is process T X ®T 2 : V(AiA 2 ) -> V(BiB 2 ) such that 

(Ti®T 2 )(u/)(z,y) =w(T*x ) T 2 *j/) (2) 

for all x G A(Ai) and y G A(A 2 ). 

The category of quantum models and completely 
positive mappings is a monoidal probabilistic theory in 
this sense, as is the category of real quantum models 
and completely positive mappings. It will simplify the 
discussion to assume, in the balance of this paper, 
that we are working in some monoidal non-signaling 
probabilistic theory, so that we can take advantage of (2). 



II. CORRELATIONAL PROPERTIES OF 
QUANTUM COMPOSITES 

In this section, H 7 7C are finite-dimensional real or 
complex Hilbcrt spaces (with inner products conjugate- 
linear in the second argument, in the complex case). 
The space of linear operators on 7i is C(H); the space 
of Hcrmitian operators, Ch{H). As discussed above, if 
A(H) is the corresponding quantum probabilistic model 
(with X{H) the set of unit vectors of H, M(H), the 
set of orthonormal bases of H, and il(H), the set of 
states associated with density operators on 7i), then 
E("H) |= E(A(H)) is canonically isomorphic to_Ch{H)- 

Let H denote fi s conjugate space (with 7i = 7i if 
H is real). If x G T-L 1 write x for the same vector in H, 
so that, for any scalar c, cx = cx, and ex = e x. No te 
that inner product on H is given by (x,y) = (x,y) = 
(y, x). If T G £{H), write T for the operator Tx = Tx. 
While T 1— >• T is conjugate linear as a mapping from 
C{T-L) to C{T-L), it is linear as a mapping between the 
real vector spaces E(H) = C h (U) and E(H) = C h ^H). 
Indeed, T M- T defines an order-isomorphism between 
these spaces. 

For any vectors x,y G H, let x y denote the rank- 
one operator on H given by (xQy)z = (z, y)x. (In Dirac 



notation, this is If a; is a unit vector, then xQx = 

P x , the orthogonal projection operator associated with x. 
The mapping x,y h-> x(Dy is sesquilincar, that is, linear in 
its first, and conjugate linear in its second, argument; it 
therefore extends uniquely to a linear mapping T-L®H — > 
C(H). It is easy to check that this is inective, and hence, 
on dimensional grounds, an isomorphism. 

Suppose now that a is a state on A(H), represented 
by a density operator W on H, so that a{x) = (Wx,x) 
for all unit vectors x G X(H). Let W have spectral 
resolution 

xeE xeE 

where E is an orthonormal basis for H (so that \ x = a(x) 

for all x € E). Functional calculus gives us W 1 / 2 = 

^ 1 / 2 

J2x<=E Xx xQx. Using the isomorphism C\H) ^ 'H®'H, 

we can interpret W 1 ' 2 as a vector vw & H, namely 
v w := Xl^x ® x. (3) 

x<£E 

This is a unit vector, and so, in turn, defines a pure 
bipartite state uj on the composite quantum system 
A~A := A(H®TL), with u(y,z) = \{y (g) z, v w )\ 2 . for all 
y,z € X(H). The marginal, or reduced, state ui\ on first 
component system assigns to an effect a a probability 

wi(a) — ((a ® l^)vw , vw) 

l£E y£E 

= ^2 X x(ax,x) 
xeE 

= J2 X xT^(Pxa) = Tr{Wa). 

X 

In other words, <Ji = a. A similar computation gives us 
u)2 = ct, the state on A{H) corresponding to W . Indeed, 
the state ui is symmetric, in the sense that uj(x,y) = 
uj(y,x) for all x, y G X{H). 

Notice that, by (3), we have |(x ® x, vw)\ 2 = oc(x) for 
every x G E. Evidently, the pure state ui corresponding 
to v\y sets up a perfect correlation between E G M(7i) 
and the corresponding test E = {x\x G E} G M{H). 
This works for any basis E diagonalizing W, i.e., uj 
correlates every such basis with the corresponding basis 
E for H. Indeed, if U is a unitary operator on H 
with UW = WU, then uj{Ux,Uy) = oj(x,y) for all 
x,y£X(H). 

Definition 3: A non-signaling state uj on a composite 
AB is correlating iff there exists some pair of tests 
E G M{A) and F G M{B) and a bijection / : E -> F 
such that oj(x, y) = for (x,y) G E X F with y ^ f(x) 
- equivalently, J^xee w ( x > f( x )) = L 

Using this language, we can summarize the forego- 
ing discussion as follows: every state a on a quantum 
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model A(H) is the marginal of a pure, correlating state 
on A(Ti ® TV), invariant under the group of unitaries of 
the form U ® U with U stabilizing a. 4 In the special 
case where a is the maximally mixed state, i.e., where 
W = 1/n, where n = dim('H)) and 1 is the identity 
operator on T-L, we the corresponding vector from (2) 
is vw = ~^^2 X £E X ® x =: ^> the maximally entangled 
stateSince W = 1/n is diagonalizcd by every orthonor- 
mal basis, the expansion above is valid for all E G M {TV}. 
Let's denote the corresponding non-signaling state on 
E(Ti®Ti) by t)a, so that rj A (a, b) = |(*,a® b)\ 2 . This 
state perfectly correlates every test E G M{Ti) with its 
counterpart in M. (H), and the correlation is uniform, in 
that t]a(x,x) = 1/n for all correlated pairs of outcomes 
x and x. If fact, It is not hard to see that, for arbitrary 
effects a, b G E(^4), we have r]A(a,b) = Tr(a6). In other 
words, the maximally entangled state onTifEiTi (rather 
than on Ti ® T-L) provides a direct operational interpre- 
tation of the tracial inner product on F,(H) = Lh(Ti). 

Suppose now that uj is any non-signaling state on 
the composite quantum system associated with any two 
Hilbert spaces Ha and Hb- Since the spaces E(A) and 
V(B) are canonically isomorphic to the spaces Ch^Ha) 
and Ch{H-B) of Hcrmitian operators on Ha and He, 
we can regard the conditioning map uj : E(A) — > E(£>) 
associated with u> as a mapping Cu{T-La) £}i(Hb)- 
The following is well known, and straightforward to 
verify: 

Lemma 1: Let W be any density operator on T-L, and let 
lu be the pure state on A(Ti ® TV) corresponding to Vw , 
as given by (2) and (3) above. Then the conditioning 
map corresponding to uj is given by u)(a) ~ W 1 / 2 aW 1 / 2 
for allae E(H). 

Corollary 1: Let W and uj be as in Lemma 1. If W is 
non-singular, then u) is an isomorphism state. 

Proof: If W is non-singular, so is W 1 ' 2 , with inverse 
given by W~ x ^ 2 - again, a positive operator. So 

Q : £(U) -> £(H) given by Q(a) = W 1/2 aW 1/2 is 
invertible, with inverse w" 1 : a \-> W~ 1 ^ 2 aW~ 1 ^ 2 - 
again, a positive mapping. □ 

Remark: Quaternionic Hilbert spaces present special 
problems, owing to the difficulty of defining a tensor 
product over a non-commutative division ring. However 
[3|, one can view a quaternionic Hilbct space as a pair 
(H, J) where H is a complex Hilbert space and J is an 
anti- unitary operator on T-L satisfying J 2 = —1. Like- 
wise, a real Hilbert spaces can be identified with com- 
plex Hilbert spaces equipped with anti-unitary operator 
J satisfying J = 1. Given two quaternionic Hilbert 



4 It is also true, by the Schmidt decomposition, that every pure 
state on H ® H is correlating. 



spaces {Hi, J\), a natural candidate for the tensor prod- 
uct of two pairs (Hi,Ji) with Ji anti- unitary and satis- 
fying J 2 = -1, is (7^10^2, JiOJa)- Since Jf = -1, wc 
have (Ji ® J2) 2 = 1, i.e., the tensor product should be 
thought of as a real Hilbert space. Understanding com- 
posites of quaternionic quantum systems in this way, with 
a little care one can show that QM over H still enjoys 
the correlational features just disucusscd. In other words, 
these features are equally consistent with real, complex 
and quaternionic quantum mechanics. 



III. CONJUGATE SYSTEMS 

We've seen that the composite quantum system 
A(H <S> H) has some very strong correlational properties. 
As I'll now show, one can derive the Jordan struc- 
ture of finite-dimensional QM from these properties, 
with a minimum of fuss. As a first step, we need to 
generalize the relationship between A(H) and A(H). 
Throughout this section, let A is a model of uniform 
rank n, meaning that all tests have cardinality n. By an 
isomorphism between two models A and B, I mean a 
bijection : X(A) X{B) such that <j>(E) G M(B) iff 
E G M{A), and /3 o g Cl(A) iff f3 € fl(B). 

Definition 4: A conjugate for A is a triple (A,-fA,VA) 
where ja '■ A ~ A is an isomorphism and t]a is a 
non-signaling state on (some composite of) A and 
A such that (a) r\{x,^ A {v)) = n{y,^A{x)) and (b) 
n A (x,jA(x)) = l/n for every 1,1/6 X(A). 5 

In the context of finite-dimensional quantum mechan- 
ics, where A = A('H) for a Hilbert space H, we can take 
A = A(W), with 7a (x) = x; for the state tja, we can 
use the standard maximally entangled state on H <£) T-L. 
Thus, we can think of t\a for an arbitrary probabilistic 
model A, as a generalized maximally entangled state. 

Remark: One might wonder whether one can use the 
isomorphism 7^4 to simply identify A with its conjugate. 
Certainly we can use ja to pull the correlator tja on 
A A back to a positive bilinear form on E(A) x E(A), 
namely rj' A {a,b) = riA{o-,lA{o))- However, whether 
this corresponds to a legitimate bipartite state on a 
legitimate composite AA of A with itself, depends on the 
particular probabilistic theory at hand. For example, 
if A = A(Ti) is the quantum model associated with a 
Hilbert space H, and A is the model associated with Tt 
in the usual way, then the state t\a pulls back along the 



5 The requirement that t\a be symmetric is mainly a convenience: 
given any uniformly perfectly correlating bipartite state 77 on 
AA, i.e., any state satisfying r){x, ~fA{ x )) = i/ n f° r all x € X(A), 
the state if defined by rf {x, 7(3/)) = r)(y,-/{x)) is also perfectly 
uniformly correlating, whence, so is the symmetic state (77 + 
rf)/2. 
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isomorphism a H> a to a bilinear form on Ch{H) x Ch(H) 
- but one associated with the non-completely positive 
partial-transpose operation, which corresponds to no 
state on A(Ji (g) T-L). Thus, the notion of a conjugate 
system is best understood as applying to an entire 
probabilistic theory, rather than to a single probabilistic 
model. 

Four Axioms We can now restate the four conditions 
discussed in the introduction in more precise terms: 

Axiom 1 (Conjugates): A has a conjugate, A. 

Axiom 2 (Correlation): Every state of A is the marginal 
of a corelating state on A A. 

Axiom 3 (Sharpness): For every outcome x G X(A), 
there exists a unique state 5 X G fl(A) such that S x (x) = 1. 

Axiom 4 (State Preparation): Every state a with 
a{x) > for all outcomes x, can be prepared by a 
reversible process from the maximally mixed state 
p(x) = 1/n. 

Axioms 1-3 arc trivially satisfied in discrete classical 
probability theory As observed in Section 2 they are also 
satisfied in finite-dimensional quantum theory. Axiom 4 
is equivalent to the homogeneity of the cone \ r (A) + , and 
hence, also satisfied by classical and quantum probability 
theory. 

We are now ready to prove Theorem 1. The main 
order of business is to show that a model satisfying 
Axioms 1, 2 and 3 is self-dual. The proof is not difficult. 
I'll break it up into a sequence of even easier lemmas. 
In the interest of readability, in what follows I conflate 
outcomes x £ X(A) with the corresponding effects 
x G E(A), and write x for ja(x). 

Lemma 2: For every a G V + (A), there exists a test E 
such that 

a = ^2 a(x)8 x (4) 

xEE 

Proof: We can assue that a is a normalized state. Then, 
by Axiom 2, a = u)i where u correlates some pair of tests 
E G M(A) with F eM(A) along a bijection / : E -> F. 
By the law of total probability (1) for non-signaling 
states, a = Y^yeF^ffl^Mv- Since u is correlating 
along /, if x G E and y = f(x), we have u>i\y(x) = 1. 
Thus, by sharpness (Axiom 3), u>i\y = S x . Hence, 
a = Y^ x eE^(f( x ))^x- It follows that uj 2 (f(x)) = a(x), 
giving us (4). □ 

Lemma 3: rj~A is an isomorphism state. 

Proof: We need to show that rj2 ■ E(A) — > V(A) is a 
linear isomorphism with a positive inverse. It is enough 



to show that r\A maps the positive cone of E(A) onto 
that of V(A). Since x H> x is an isomorphism between 
A and A, we can apply Lemma 2 to A: if a G V + (A), 
we have a = J2 X £E a (%)o~-- Since t\a (x, x) = 1/n, 
we have t]a{x) = —8% for every x £ X(A). Hence, 

VA{J2 X £E na ( x ) x ) =oc. U 

Lemma 4: Every a € E(A) has a representation 
a = ^2 x£E t x x for some test E G A4(A) and some 
coefficients t x . 

Proof :li a G E(A) + , then by Lemma 2, ?7A(a) = 
YlxeF^x^x for some E G M(A). By Lemma 3, r\A is 
an order- isomorphism. Applying 77^ 1 to this expan- 
sion gives the desired result. Now for an arbitrary 
a G E(A) + , we can find some N such that a < Nu. If 
a = ai — a 2 with ai,a 2 G E(A) + , we can find N > 
with a 2 < Nu. Thus, b := a + Nu = a\ + (Nu - a 2 ) > 0. 
Then b := YIxge^x for some E G M, so that 

a = b-Nu = T, x£ E t ^ x - N iT, X EE x ) = J2 x£ e(^^ N ) x - 
□ 

Lemma 5: The bilinear form (a,b) = r]A{a,^A{b)) is 
positive-definite, i.e., an inner product. 

Proof: By assumption, it's symmetric. It's also bilinear, 
because t\a is non-signaling and ja is linear. We need to 
show that ( , ) is positive semi-definite. Let a G E(A). 
From Lemma 4, we have a = ^2 x€E t x x for some test E 
and some coefficients t x . Now 

(a,a) = (^t x x,^t y y\ = ^ t x t y (x,y) 

\xeE yeE I x,y£ExE 

txt y VA(x,y) 

x,y£ExE 

= -£t, 2 >o. 

x£E 

This is zero only where all coefficients t x are zero, i.e., 
only for a = 0. □ 

Lemma 2 tells us that E(A) ~ ~V(A), so it remains 
only to show that the inner product ( , ) is self- 
dualizing. Clearly E(A) + C ~E(A) + , since r](a,b) > 
for all a, b G E(A) + . For the reverse inclusion, suppose 
(a, b) > for all b G E(A) + . Then (a,y) > for all 
y G X. By Lemma 4, a = ^2 x£E t x x for some test E. 
Thus, for all y E E we have (a, y) = t y > 0, whence, 
a G E(A)+. □ 

Axiom 4 now guarantees then ~V(A) and hence, E(A), 
is homogeneous. Indeed, as discussed above, a reversible 
process is an order- automorphism <j> : ~V(A) — > V(A); 
to say that this prepares a, up to normalization, from 
p, is simply to say that a = t<fi(p) for some appropriate 
constant t > (namely, t = ua(4>(p))~ 1 )- Since t(f> is 
again an order-automorphism, it follows that the group 
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of order-automorphisms of V(A) act transitively on the 
interior of V(A) + , i.e., V(A) is homogeneous. 

We now have the advertised result: Subject to axioms 
1-4, E(A) is homogeneous and self-dual, whence, by 
the Koccher-Vinberg Theorem, can be equipped with a 
euclidean Jordan structure making E(A) + the cone of 
squares. Indeed, this Jordan structure is unique, subject 
to the condition that ua act as the unit for this Jordan 
structure. (One can further show that the outcome-set 
X(A) is precisely the set of primitive idempotents in 
E(A) with respect to this Jordan structure, and that ba- 
sic measurements in A4(A) correspond to Jordan frames, 
i.e., maximal pairwise orthogonal sets of idempotents. 
See Lemma 5 in Q.) 

Remarks (1) Note that the only point at which Axioms 2 
and 3 were invoked was in the proof of Lemma 2. Thus, 
any other assumptions leading to the representation (4) 
could be used instead. Moreover, since t}a is a linear 
isomorphism, and hence (in finite dimensions) an homc- 
omorphism, it is enough to obtain this representation for 
states in the interior of Q. From this we have, as in the 
proof of Lemma 3, that the interior of the cone V + is in 
the range of t}a, from which it follows that rj is a linear 
isomorphism, and hence, that K := ij _1 (V° ) is an open 
sub-cone of E + , spanning the latter. Moreover, every 
point in the interior of K has a corresponding "spectral" 
decomposition of the form ^2 xeE t x x with t x > 0, where 
E is some test in M(A). Arguing exactly as in the proof 
of Lemma 4, we can extend this to arbitrary points a 6 E 
by decomposing a as ai — a^, where oi, a<i £ K. The rest 
of the proof of Theorem 1 then proceeds just as before. 

(2) Since axioms 1 and 2 have such a similar char- 
acter, it is natural to look for a single principle that 
encompasses them both. Suppose G is a group acting 
transitively on the outcome-space X(A) of the model 
A, and leaving the state-space Sl(A) invariant. If G is 
compact, there will exist an invariant state, p, obtained 
by group averging; by the transitivity of G on outcomes, 
this state must be constant, i.e., p is the maximally 
mixed state p(x) = l/n. Now consider the following 
variant of Axiom 2 (here G a denotes the stabilizer in G 
of the state a): 

Axiom 2' There exists a system A and an isomorphism 
7^4 : A ~ A, such that every state a is the marginal, 
uj\, of some correlating bipartite state u) on AB with 
uigXtJAigy)) = u{x,^ A {y)) for every g £ G a . 

As observed in Section 2, this is satisfied by finite- 
dimensinal quantum models. Applied to the maximally 
mixed state p, this produces a correlator t\a turning A 
into a conjugate in the sense of Definition 3. Thus, we 
have 

Corollary 2: Let A carry a compact, transitive- on- 
outcomes group action, as described above, and satisfy 
Axioms 2' and 3. Then E(A) is self-dual. 



(3) Given Axioms 1-3, any condition guaranteeing the 
homogeneity of V(A) will also secure that of E(A). As 
observed in Q, homogeneity follows from the require- 
ment that every interior state of A be the marginal of 
an isomorphism state on a composite of two isomorphic 
copies of A. Thus, in place of Axiom 4, we could simply 
strengthen Axiom 2 to 

Axiom 2" Every interior state of A is the marginal of 
an isomorphism state on AA, correlating some pair of 
tests. 

Corollary 3: For any model A satisfying Axioms 1, 2" 
and 3, E(A) is homogeneous and self- dual. 

For irreducible systems, isomorphism states are pure 
[3 .so Axiom 2" is related to the purification postulate 
of Q . The latter asserts that every system A has a "con- 
jugate system" (in their usage) B, such that every state 
on A arises as the marginal of a pure state of AB, unique 
up to symmetries of B. 

IV. FILTERS 

To this point, I've been leaning heavily on the assump- 
tion of sharpness (Axiom 3). It would surely be prefer- 
able to define 5 X to be the conditional state r]i\ x , for each 
x £ X(A), and to prove that this is the unique state 
making x certain. In fact, this can be done if we replace 
Axiom 2 with a slightly stronger, but very plausible, ax- 
iom concerning the existence of certain processes called 
filters [13 . In many kinds of laboratory experiments, the 
distinct outcomes of an experiment correspond to phys- 
ical detectors, the efficiency of which can independently 
be attenuated, if desired, by the experimenter. 

In fact, this can be done reversibly. Let A be a 
finite-dimensional quantum system, with corresponding 
Hilbert space H, and identify E(A) with L/^?^). If 
E is an orthonormal basis representing a basic mea- 
surement on this system, define a positive self-adjoint 

l ji 

operator V : H — > H by setting Vx = t x x for 
every x £ E, where < t x < 1. This gives us a 
completely positive linear mapping (j> : E(A) — > E(A), 
namely 4>(a) = VaV. This has a completely positive 
inverse </> -1 (a) = V~ 1 aV~ 1 , hence, is an order auto- 
morphism. For each x £ E, the corresponding effect 
x £ E(A) ~ Lh(H) is the projection operator P x . It is 
easy to check that VP X V = t x P x , i.e., that <p(x) = t x x 
for every x £ E. 

Definition 5: A filter on a probabilistic model A is a 
positive linear mapping $ : V(A) — s- V(A) such that, 
for some test E G A4(A) and coefficients < t x < 1, 
$(a)(a;) = t x a(x) for all x € E and all states a. 

Let us say that (f> prepares a state a if a equals 4>(p) up 
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to normalization, where where p is the maximally mixed 
state p(x) = 1/n. If every interior state is preparablc by 
a reversible filter, then V(A)+ is homogeneous. Suppose 
that A has a conjugate system, A, and that (f> is a filter 
for a test e .M(A). By applying <fi to one wing of 
the composite system AA 7 we can convert the correlator 
rjA into a new non-signaling, sub-normalized joint state 
w, given by w(x,y) = n A (4>*x,y) for all x e X(A),y g 
X(B). Noticing that $*(z) = i^x for every x € E, we 
see that w correlates -E 1 with £7: ii x,y € E with x ^ y, 
we have uj(x,y) = r) A (t x x,y) = t x T) A (x,y) = 0. Since 
Wi = p o 0, it follows that any state preparable from p 
by a filter, is the marginal of a correlating state. So, in 
the presence of sharpness we can replace Axiom 2 by the 
axiom that every state be preparable by a filter. In fact, 
we can do a bit better. 

The isomorphism ja ■ A ~ A extends to an order- 
automorphism *V(A) ~ V(A), given by a i— > a, with 
a(x) = a(x) for all x g X(A). Hence, a positive 
linear mapping $ : V(A) — > V(A) has a countepart 
¥ : V(A) V(A), given by ¥(a) = $(a). Let us say 
that $ is symmetric with respect to a conjugate A iff 
= VA(x,W(y)) for all x,y g X(A), i.e., iff 

7]A O ($ (g 1) = T] A O (1 (g) $). 

Lemma 6: Let A have a conjugate A. Suppose that 
every state of A is preparable by a symmetric filter. 
Then (a,b) := ^(0,7^(6)) is a self-dualizing inner 
product on 'E(A). 

Proof: Let a = &(p), where $ is a symmetric filter on 
some test E. Consider the bipartite state 

u := = (1®*)(»M). 

For each outcome x £ X(A), let S x denote the conditional 
state {r]A)i\x- For all x g E, we have 



wife(y) 



7? A ($*( ? /),Ir) 
?7a($*(ua),^) 



?M(y,i>*Mx)) 
?7a(ma,$*(x)) 
VA{y,t x x) 

n A {uA,txx) 



Va(v,x 



= (Va)i\x = &x 



t] A {ua,x) 

Now ui\ = $((t]a)i) = $(p) = c>, and also, by the 
law of total probability (1), u)\ = Y^xeE ^2(^)^115 = 
Y^xeE tx^xi where t x = uj 2 (x). Thus, every state on A 
is a convex combination of the states 8 X . Hence, the 
cone generated by these states coincides with V(A)+. 
We now have that rj maps E(^4) + onto ~V(A) + , as in 
the proof of Lemma 3. The proof that (a, b) := n(a, b) 
defines an inner product on E(A) now proceeds as in the 
proof of Lemmas 4 and 5. □ 

This suggests another axiom, combining Axiom 1 with 
a strengthened form of Axiom 4: 



Axiom 4b A has a conjugate system, and every interior 
state is preparable by a reversible symmetric filter. 

This clearly implies the homogeneity of ~V(A). In fact, 
it is strong enough to allow us to do without Axioms 
2 and 3. Indeed, as noted in Remark (1) following the 
proof of Theorem 1, it is sufficient to obtain the decom- 
position (4) for points in the interior of V + . By Lemma 
6, for any system satisfying Axiom 5, all states in the 
interior of Q can be decomposed as in equation (4); as 
noted in Remark (1) following the proof of Theorem 1, 
this is enough to secure the self-duality of E(A), and its 
isomorphism with V(A). This proves Theorem 2. 



V. CONCLUSION 

We've seen that any of several related sets of assump- 
tions, e.g., Axioms 1-4, or Axioms 2', 3 and 4, pr Axioms 
1, 2' and 3, or the two-part Axiom 4b, lead in a very 
simple way the homogeneity and self-duality of the cone 
E(j4) + associated with a probabilistic model A. Hence, 
by the Koecher-Vinberg Theorem, the space E(A) carries 
a canonical Jordan struture. While this is not the only 
route one can take to deriving this structure (see, e.g, |l5| 
and fl8j for approaches stressing symmetry principles), 
it does seem especially straightforward. 

Among Jordan-algebraic probabilistic theories, finite- 
dimensional quantum mechanics over C can be singled 
out as follows. A non-signaling composite system AB 
locally tomographic iff every state u> G AB is uniquely 
determined by the joint probability function ui(x, y) that 
it induces. It is well known, and easy to see on dimen- 
sional grounds, that among finite-dimensional real, com- 
plex and quatcrnionic quantum mechanics, only the com- 
plex version is locally tomographic. Call a probabilistic 
theory monoidal iff it is a symmetric monoidal category, 
in which the monoidal product is a non-signaling com- 
posite in the sense of Definition 2 above. By exploiting 
a result of Hanche-Olsen [T3|, one can show [f| that a 
Jordan-algebraic theory in which all composites are lo- 
cally tomographic, and which contains at least one sys- 
tem having the structure of a qubit, must be a direct sum 
of finite-dimensional complex matrix algebras — that is, 
finite-dimensional complex QM with superselection rules. 
One should perhaps not rush to embrace local tomogra- 
phy as a universal principle, however. Indeed, the very 
fact that it excludes real and quaternionic quantum the- 
ory suggests that it is too strong: see [|[ for some cogent 
reasons not to exclude these cases. 

Several other recent papers (e.g, d, d, [H, [ID, HU) 
have derived standard finite-dimensional quantum me- 
chanics, over C, from operational axioms. Besides the 
fact that the mathematical development here is much 
quicker and easier (modulo invocation of the KV the- 
orem), the axiomatic basis is different, and arguably 
leaner. The papers cited in the introduction tend to im- 
pose strong constraints on "subspaces" , along the lines 
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of assuming that every face of the state space corre- 
sponds to the state space of a system satisfying the re- 
maining axioms. A related assumption, also used in sev- 
eral of the cited papers, is that all systems characterized 
by the same "information-carrying capacity" are isomor- 
phic. The present approach entirely avoids such assump- 
tions. I also avoid the assumption, used in [T3, EH that 
every element of \(A)* corresponds to a physically ac- 
cessible measurement result. Finally, all of the cited pa- 
pers assume some form of local tomography. In view of 
the comments above, it seems valuable to be able to de- 
lineate clearly what does and what does not depend on 
this assumption (particularly if we are interested in the 
possibilities for a "post-quantum" theory). 

This brings us to the interesting question of whether 
all euclidean Jordan algebras actually satisfy the axioms 
discussed here. Let X denote the set of primitive (that 
is, minimal) idempotcnts in E, let A4 denote the set of 
Jordan frames (i.e., maximal sets of pairwise orthogonal 
idempotents) , and let fi denote the set of states on E 
(i.e., positive linear functionals a £ E* with a(u) = 1, 
where u is the unit element of E). Then A = (A4, 51) is 
a probabilistic model with E(^4) = E. In particular, A 
is self-dual and sharp. Taking 77^(0, &) = ^Tr(ab), where 
r is the rank of E, we have a perfectly correlating, non- 
signaling bipartite state. Using the spectral theorem for 
Jordan algebras, plus the quadratic representation [Ioj |. 
one can show that every state of A can be prepared by a 
filter. Hence, every state is the marginal of a correlating 
bipartite state, as discussed in Section 4. 

What isn't obvious is how to interpret the state r\ just 



defined. In orthodox QM, in fact, it isn't a state at 
all: rather, one needs to invoke the complex conjugate 
Hilbcrt space. What meaning one should attach to r\A 
must depend on the choice of the conjugate system A, 
and on that of the composite system AA — - and this, 
in turn, depends, not on the individual model A, but on 
the entire probabilistic theory at hand. So the question 
remains: can we embed arbitrary euclidean Jordan alge- 
bras in a probabilistic theory in which axioms 1, 2' and 
3 are satisfied? 

Even assuming that this is possible mathematically, 
there is still a question of how to interpret the conjugate 
system A. In quantum theory, one can regard the con- 
jugate Hilbert space H as representing a time-reversed 
version of the system represented by H. Whether some 
such interpretation can be maintained more generally 
remains to be addressed. Alternatively, one might view 
the existence of a conjugate as a way of formulating 
von Neumann's "projection postulate": if we take A to 
represent the system at one moment, and A, the system 
"immediately afterwards" , then tja represents a state in 
which we expect that, whatever measurement is made, 
and whatever result is secured, if thai same measurement 
were immediately repeated, the result would be the same. 
In any case, the meaning of the conjugate, and of the 
correlator t\a, must certainly depend on the entire proba- 
bilistic theory. These matters will be taken up elsewhere. 
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